core, eth: improve delivery speed on header requests #23105

holiman · 2021-06-24T12:17:13Z

This PR reduces the amount of work we do when answering header queries, e.g. when a peer is syncing from us.

For some items, e.g block bodies, when we read the rlp-data from database, we plug it directly into the response package. We didn't do that for headers, but instead read headers-rlp, decode to types.Header, and re-encode to rlp. This PR changes that to keep it in RLP-form as much as possible.
When a node is syncing from us, it typically requests 192 contiguous headers. On master it has the following effect:

For headers not in ancient: 2 db lookups. One for translating hash->number (even though the request is by number), and another for reading by hash (this latter one is sometimes cached).
For headers in ancient: 1 file lookup/syscall for translating hash->number (even though the request is by number), and another for reading the header itself. After this, it also performes a hashing of the header, to ensure that the hash is what it expected.
In this PR, I instead move the logic for "give me a sequence of blocks" into the lower layers, where the database can determine how and what to read from leveldb and/or ancients.

There are basically four types of requests; three of them are improved this way. The fourth, by hash going backwards, is more tricky to optimize. However, since we know that the gap is 0, we can look up by the parentHash, and stlil shave off all the number->hash lookups.

The gapped collection can be optimized similarly, as a follow-up, at least in three out of four cases.

holiman · 2021-06-24T12:20:42Z

This may actually make the performance a bit worse, in the case that a peer is 3-4 blocks behind us, and wants to catch up: requesting a few headers close to our tip (or beyond). Previously, we would have the headers in object-form in a header cache, whereas this PR makes them read from db. I'll look into it

holiman · 2021-06-24T12:50:42Z

This may actually make the performance a bit worse, in the case that a peer is 3-4 blocks behind us, and wants to catch up:

Pushed a commit to use the headerchain cache better.

core/headerchain.go

core/rawdb/accessors_chain.go

holiman · 2021-07-06T09:04:07Z

Triage discussion: probably safe, but we should hold with this until after the London release.

core/types/block.go

holiman · 2021-07-06T09:23:48Z

TODO (not in this PR), add basefee check to sanityCheck

core/types/block.go

holiman · 2021-08-13T13:01:15Z

Rebased on top of the eth/66 changes that got merged. This still does not make use of the faster ancient accessors though

holiman · 2021-09-08T20:07:56Z

Rebased again. Also, tests are added to check the correctness of the split leveldb/ancient loader

holiman · 2021-09-21T21:03:07Z

This is now rebased on top of #23566, which should go in first

core/types/block.go

holiman · 2021-10-25T14:46:23Z

Rebased + robustified

holiman · 2021-11-29T12:06:09Z

Rebased (Part V)

holiman · 2021-11-30T10:49:24Z

I tested this a bit on a live node. On my node, I made it

When the 'new' code path was requested, it did both a old-style lookup and a new-style lookup
When doing so, it randomized: sometimes doing old style first, sometimes doing new-style first,
After pulling both answers, it compared the byte output of the two, to see if they matched.

Example:

INFO [11-30|11:28:29.065] Served block headers                     newtime=1.877586ms  oldtime=8.107161ms  order="new version first" reverse=false skip=0 amount=192 origin="{Hash:0x0000000000000000000000000000000000000000000000000000000000000000 Number:8166928}"

I then collected some stats about the performance.

// Old version first
// Total old: 1.420655265s, Total new: 131.683183ms, count 263, avg old: 5.401731ms, avg new: 500.696µs
// New version first
// Total old: 1.440066122s, Total new: 116.201786ms, count 253, avg old: 5.69196ms, avg new: 459.295µs

It doesn't matter a whole lot which handler was first, for serving 192 headers on an intel NUC with SSD, the current handler takes ~5.5ms, the one in this PR takes about 0.5ms, so one order of magnitude faster.

Also, no difference in the two responses have been found so far

holiman · 2021-11-30T13:17:41Z

After letting it run for a bit longer:

Total old: 7.335648596s, Total new: 709.078148ms, count 1208, avg old: 6.072556ms, avg new: 586.985µs

…hash

This PR reduces the amount of work we do when answering header queries, e.g. when a peer is syncing from us. For some items, e.g block bodies, when we read the rlp-data from database, we plug it directly into the response package. We didn't do that for headers, but instead read headers-rlp, decode to types.Header, and re-encode to rlp. This PR changes that to keep it in RLP-form as much as possible. When a node is syncing from us, it typically requests 192 contiguous headers. On master it has the following effect: - For headers not in ancient: 2 db lookups. One for translating hash->number (even though the request is by number), and another for reading by hash (this latter one is sometimes cached). - For headers in ancient: 1 file lookup/syscall for translating hash->number (even though the request is by number), and another for reading the header itself. After this, it also performes a hashing of the header, to ensure that the hash is what it expected. In this PR, I instead move the logic for "give me a sequence of blocks" into the lower layers, where the database can determine how and what to read from leveldb and/or ancients. There are basically four types of requests; three of them are improved this way. The fourth, by hash going backwards, is more tricky to optimize. However, since we know that the gap is 0, we can look up by the parentHash, and stlil shave off all the number->hash lookups. The gapped collection can be optimized similarly, as a follow-up, at least in three out of four cases. Co-authored-by: Felix Lange <fjl@twurst.com> (cherry picked from commit db03faa)

This PR reduces the amount of work we do when answering header queries, e.g. when a peer is syncing from us. For some items, e.g block bodies, when we read the rlp-data from database, we plug it directly into the response package. We didn't do that for headers, but instead read headers-rlp, decode to types.Header, and re-encode to rlp. This PR changes that to keep it in RLP-form as much as possible. When a node is syncing from us, it typically requests 192 contiguous headers. On master it has the following effect: - For headers not in ancient: 2 db lookups. One for translating hash->number (even though the request is by number), and another for reading by hash (this latter one is sometimes cached). - For headers in ancient: 1 file lookup/syscall for translating hash->number (even though the request is by number), and another for reading the header itself. After this, it also performes a hashing of the header, to ensure that the hash is what it expected. In this PR, I instead move the logic for "give me a sequence of blocks" into the lower layers, where the database can determine how and what to read from leveldb and/or ancients. There are basically four types of requests; three of them are improved this way. The fourth, by hash going backwards, is more tricky to optimize. However, since we know that the gap is 0, we can look up by the parentHash, and stlil shave off all the number->hash lookups. The gapped collection can be optimized similarly, as a follow-up, at least in three out of four cases. Co-authored-by: Felix Lange <fjl@twurst.com>

holiman requested review from karalabe and rjl493456442 as code owners June 24, 2021 12:17

holiman force-pushed the faster_headers branch from 3f06931 to 0458722 Compare June 24, 2021 13:40

s1na reviewed Jun 25, 2021

View reviewed changes

core/headerchain.go Outdated Show resolved Hide resolved

core/headerchain.go Outdated Show resolved Hide resolved

core/rawdb/accessors_chain.go Outdated Show resolved Hide resolved

Kampoz18 approved these changes Jun 25, 2021

View reviewed changes

holiman added the status:triage label Jul 6, 2021

karalabe reviewed Jul 6, 2021

View reviewed changes

core/types/block.go Outdated Show resolved Hide resolved

karalabe reviewed Aug 3, 2021

View reviewed changes

core/types/block.go Outdated Show resolved Hide resolved

holiman commented Aug 3, 2021

View reviewed changes

core/types/block.go Outdated Show resolved Hide resolved

karalabe added this to the 1.10.7 milestone Aug 3, 2021

karalabe self-assigned this Aug 3, 2021

karalabe modified the milestones: 1.10.7, 1.10.8 Aug 10, 2021

holiman removed the status:triage label Aug 10, 2021

holiman force-pushed the faster_headers branch 2 times, most recently from 7cecb9c to 5d34b01 Compare August 13, 2021 13:00

fjl modified the milestones: 1.10.8, 1.10.9 Aug 24, 2021

holiman force-pushed the faster_headers branch from 5d34b01 to 3d65612 Compare August 25, 2021 10:38

holiman force-pushed the faster_headers branch from 2897f56 to 8114ca2 Compare September 8, 2021 20:05

holiman force-pushed the faster_headers branch from 8114ca2 to e61b5de Compare September 21, 2021 21:02

fjl modified the milestones: 1.10.9, 1.10.10 Sep 29, 2021

holiman modified the milestones: 1.10.11, 1.10.12 Oct 20, 2021

holiman commented Oct 25, 2021

View reviewed changes

core/types/block.go Outdated Show resolved Hide resolved

fjl reviewed Oct 25, 2021

View reviewed changes

core/types/block.go Show resolved Hide resolved

holiman force-pushed the faster_headers branch from e61b5de to c89118f Compare October 25, 2021 14:45

karalabe modified the milestones: 1.10.12, 1.10.13 Nov 8, 2021

holiman modified the milestones: 1.10.13, 1.10.14 Nov 24, 2021

holiman force-pushed the faster_headers branch from 77b7f9f to 24948ab Compare November 29, 2021 12:03

holiman added 6 commits December 3, 2021 09:41

core, eth: improve delivery speed on header requests

0295325

core: add rlp cheat to avoid full decoding for reading header parent …

f6cee60

…hash

eth/handlers, core/rawdb: make use of faster ancient read method

b740b27

core/rawdb: changes due to rebase

8fffa9d

core/types: more robust parent hash shortcut

230fa5f

core/rawdb: fix lint nits

b80cb55

holiman force-pushed the faster_headers branch from 24948ab to b80cb55 Compare December 3, 2021 09:02

fjl added 3 commits December 7, 2021 02:41

core/types: improve comment

0d708d1

Merge branch 'master' into faster_headers

e3ddfc2

eth/protocols/eth: fix typo

3aed4a1

fjl self-assigned this Dec 7, 2021

fjl merged commit db03faa into ethereum:master Dec 7, 2021

This was referenced Sep 23, 2022

Metadium to master METADIUM/go-metadium#24

Closed

Metadium to master METADIUM/go-metadium#25

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

core, eth: improve delivery speed on header requests #23105

core, eth: improve delivery speed on header requests #23105

holiman commented Jun 24, 2021 •

edited by s1na

Loading

holiman commented Jun 24, 2021

holiman commented Jun 24, 2021

holiman commented Jul 6, 2021

holiman commented Jul 6, 2021

holiman commented Aug 13, 2021

holiman commented Sep 8, 2021

holiman commented Sep 21, 2021

holiman commented Oct 25, 2021

holiman commented Nov 29, 2021

holiman commented Nov 30, 2021 •

edited

Loading

holiman commented Nov 30, 2021

core, eth: improve delivery speed on header requests #23105

core, eth: improve delivery speed on header requests #23105

Conversation

holiman commented Jun 24, 2021 • edited by s1na Loading

holiman commented Jun 24, 2021

holiman commented Jun 24, 2021

holiman commented Jul 6, 2021

holiman commented Jul 6, 2021

holiman commented Aug 13, 2021

holiman commented Sep 8, 2021

holiman commented Sep 21, 2021

holiman commented Oct 25, 2021

holiman commented Nov 29, 2021

holiman commented Nov 30, 2021 • edited Loading

holiman commented Nov 30, 2021

holiman commented Jun 24, 2021 •

edited by s1na

Loading

holiman commented Nov 30, 2021 •

edited

Loading